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ABSTRACT 

^ ' We present the final catalogue of the 2dF QSO Redshift Survey (2QZ), based on 

\ Anglo-Australian Telescope 2dF spectroscopic observations of 44576 colour-selected 

. (u&jr) objects with 18.25 < 5j < 20.85 selected from APM scans of UK Schmidt Tele- 

O ; scope (UKST) photographic plates. The 2QZ comprises 23338 QSOs, 12292 galactic 

stars (including 2071 white dwarfs) and 4558 compact narrow-emission-line galaxies. 
We obtained a reliable spectroscopic identification for 86 per cent of objects observed 
with 2dF. We also report on the 6dF QSO Redshift Survey (6QZ), based on UKST 
6dF observations of 1564 brighter 16 < 6j < 18.25 sources selected from the same pho- 
tographic input catalogue. In total, we identified 322 QSOs spectroscopically in the 
6QZ. The completed 2QZ is, by more than a factor 50, the largest homogeneous QSO 
Q i catalogue ever constructed at these faint limits (6j < 20.85) and high QSO surface 

( ' densities (35 QSOs deg~^). As such it represents an important resource in the study of 

^ \ the Universe at moderate-to-high redshifts. As an example of the results possible with 

d • the 2QZ, we also present our most recent analysis of the optical QSO luminosity func- 

tion and its cosmological evolution with redshift. For a fiat, fim = 0.3 and 51 a = 0.7, 
Universe, we find that a double power law with luminosity evolution that is exponen- 
tial in look-back time, r, of the form L'^^{z) cx e^'^^"^, equivalent to an e-folding time 
\ of 2Gyr, provides an acceptable fit to the redshift dependence of the QSO luminosity 

function over the range 0.4 < z < 2.1 and Mb, < —22.5. Evolution described by a 
quadratic in redshift is also an acceptable fit, with L'^_^{z) cx ioi-392;-o.29^ 

Key words: quasars: general - galaxies: active - galaxies: Seyfert - stars: white 
dwarfs - catalogues - surveys 
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1 INTRODUCTION 

The new generation of QSO surveys provide us with an un- 
paralleled database with which to study the properties of 
the QSO population. In this paper we report on the com- 
pletion of 2dF QSO Redshift Survey (2QZ) and the associ- 
ated 6dF QSO Redshift Survey (6QZ). These surveys pro- 
vide almost 25000 QSOs in a single homogeneous database, 
covering almost five magnitudes (16 < fej < 20.85). When 
combined with QSOs from complementary spectroscopic ob- 
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servations carried out as part of the Sloan Digital Sky Survey 
(SDSS; e.g. York et al. 2000), the number of known QSOs 
has increased more than five- fold in the past five years. Fur- 
thermore, the vast majority of known QSOs are now in a 
few homogeneous samples with well-defined selection crite- 
ria, rather than a highly heterogeneous assemblage of small 
surveys and serendipitous discoveries. The properties of the 
QSO population may now be determined with a new level 
of statistical precision. 

The 2QZ provides a valuable resource to study the 
large-scale structure of the Universe on the largest scales 
over a wide range of redshifts. It can also be used to directly 
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Figure 1. The distribution of measured spectral signal-to-noise per pixel in the -B-band for the best observation of each object in the a) 
2QZ and b) 6QZ catalogues. Note that a small number of sources have S/N < due to residual errors in sky subtraction of faint sources. 



probe the mass distribution via Icnsing studies. In addition, 
the 2QZ also contains significant new catalogues of other 
classes of astronomical objects, e.g. white dwarfs (Venncs et 
al. 2002) and cataclysmic variables (Marsh et al. 2002). 

The 2QZ sample presented in this paper has already 
provided new information on the power spectrum of QSOs 
(Outram et al. 2003), the QSO correlation function (Croom 
et al. 2001a) and QSO spectral properties (Croom et al. 
2002; Corbett et al. 2003). It has also been used to search 
for rare/unusual objects (Londish et al. 2002), carry out 
lensing studies (Meyers et al. 2003; Miller et al. 2003) and 
place limits on cosmological parameters (Outram et al. 2001; 
Hoyle et al. 2002). 

We also provide an updated estimate of the QSO opti- 
cal luminosity function (OLF) and its evolution with redshift 
based on the 2QZ sample in this paper. In this wc include 
a new sample of QSOs obtained from observations obtained 
with the 6dF facility on the UK Schmidt Telescope (UKST). 
The 6dF sample is based on the same input catalogue as the 
2QZ, but extends the coverage of the LF at any given red- 
shift to almost a factor of 100 in luminosity and over 1000 
in space density. This will aid tests of the claimed depar- 
ture from pure luminosity evolution at higher luminosities 
(Hewctt et al. 1993; La Franca & Cristiani 1997). 

Wc describe the 2QZ and 6QZ surveys in Section 2 and 
then discuss their selection effects and completeness in Sec- 
tion 3. In Section 4 we present our analysis of the QSO 
luminosity function. 



given by Smith et al. (2003), and so only a brief overview will 
be given here. The survey area comprises 30 UKST fields, ar- 
ranged in two 75° X 5° declination strips, one passing across 
the South Galactic Cap centred on 5 = —30° (the SGP 
strip) and the other across the North Galactic Cap centred 
on 5 = 0° (referred to in this paper as the equatorial strip, 
but also know as the NGP strip). The SGP strip extends 
from a = 21*^40 to a = 3*^15 and the equatorial strip from a 
= 9^50 to a = 14*^50 (B1950). Note that the survey was orig- 
inally defined in the B1950 coordinate system, although the 
publically available catalogue is presented with J2000 posi- 
tions. The total survey area is 721.6 deg"^, when allowance 
is made for regions of sky excised around bright stars. 

In each UKST field, APM measurements of one bj plate, 
one r plate and up to four u plates/films were used to gen- 
erate a catalogue of stellar objects with 16 < bj < 20.85. 
A sophisticated procedure was devised to ensure catalogue 
homogeneity. Corrections were made for vignetting and field 
effects due to variable desensitization in the UKST plates, 
these effects being particularly noticeable at the edges of 
plates (Smith et al. 2003; Smith 1998; Croom 1997). Can- 
didates were selected for the 2QZ based on fulfilling at 
least one of the following colour criteria: u — bj < —0.36; 
u-b] < 0.12 - 0.8(6,7 - r); 6j - r < 0.05. 

The u — bj limit was tightened to u — bj < — 0.50 for 
the 6QZ sample (bj < 18.25) to reduce the high fraction 
of contamination by Galactic staxs. With these criteria, the 
2QZ input catalogue comprised 47768 candidates and the 
6QZ comprised 1657 candidates (SGP strip only, see below). 



2 DATA 

2.1 Input catalogue 

The selection of the QSO candidates for the 2QZ and 6QZ 
surveys was based on broadband ubjr colours from APM 
measurements of UKST photographic plates. Full details are 



2.2 Spectroscopic observations 

Spectroscopic observations of the input catalogue were made 
with the 2-degree Field (2dF) instrument at the Anglo- 
Australian Telescope (the 2QZ sample) and the 6-degree 
Field (6dF) instrument at the UKST (the 6QZ sample). The 
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Figure 2. The coverage map for the 2QZ catalogue for the equatorial (top) and SGP (bottom) regions. The grey-scale indicates the 
percentage of 2QZ candidates observed over the two survey strips. Regions removed due to bright stars or plate features/defects can be 
seen, as can the unobserved fields at the 14'' end of the equatorial strip. 



2dF instrument is a multi-fibre spectrograph which can ob- 
tain simultaneous spectra of up to 400 objects at once over 
a 2° diameter field of view, and is positioned at the prime 
focus of the Anglo- Australian Telescope. Fibres are roboti- 
cally positioned within the field of view and are fed to two 
identical spectrographs (200 fibres each). Two field plates, 
and a tumbling system allow one field to be observed while 
a second is being configured, reducing down-time between 
fields to a minimum. The spectrographs each contain a Tek- 
tronix 1024 X 1024 CCD with 24 /im pixels. Details of the 
2dF instrument can be found in Lewis et al. (2002). 

The 6dF instrument is a multi-fibre spectroscopic in- 
strument on the UKST with the capability to simultane- 
ously observe up to 150 sources over a 6° diameter field of 
view. Fibres are positioned robotically onto a field plate, 
which is then installed at the focus of the UKST telescope. 
Fibres are fed to a bench mounted spectrograph containing 
an EEV 1032 x 1027 CCD with 13 /um pixels. Further details 
can be found in Watson et al. (2000). 

Based on the need to restrict the magnitude range of 
objects observed by 2dF (to reduce scattered fight) and the 
observed surface density of candidates, objects in the in- 
put catalogue with 18.25 < 6j < 20.85 were observed with 
2dF and those with 16.0 < 6,j < 18.25 were observed with 
6dF. Although the 6QZ sample is approximately 100 times 
smaller than the 2QZ, it is important in extending the cover- 
age of the QSO OLF to almost a factor of 100 in luminosity 
based on a single homogeneous input catalogue. 

Full details of the 2dF spectroscopic observations and 
subsequent catalogue were provided by Croom et al. (2001b) 
at the time of the initial 10k data release. We therefore only 
present a brief description in this paper, updating informa- 
tion where appropriate. 2dF spectroscopic observations be- 
gan in January 1997 and were completed in April 2002. The 
spectral dispersion was 4.3A pixel"^, giving an instrumental 
resolution of 9A. The spectra covered the wavelength range 
3700-7900A. Each 2dF field in the survey was observed for 
3300 - 3600 sees, giving a median signal-to-noise ratio (S/N) 
of 5.0 per pixel in the S-band (4000-4900A), see Fig. la. 
The data were taken in a wide variety of conditions. Under 
conditions of poor seeing (> 2 arcsec) or transparency, expo- 
sure times were increased to compensate for the lower signal 
rates. In the event that conditions prevented any field reach- 
ing its pre-determined completeness level, it was scheduled 
for re-observation (see below). 



The 2QZ input catalogue was merged with that of the 
2dF Galaxy Redshift Survey (2dFGRS; CoUess et al. 2001) 
and a complex tiling algorithm was applied to the resul- 
tant joint catalogue in order to maximize the efficiency with 
which the two decUnations strips could be covered with the 
minimum rmmber of circular 2dF fields. The 2QZ survey 
area is an exact subset of the 2dFGRS area. Where possi- 
ble, when conditions were unsuitable for the more exacting 
requirements of the 2QZ program (e.g. poor seeing, signif- 
icant moonlight), observations of the 2dFGRS only fields 
were carried out. 

6dF observations were performed over the period March 
2001 - September 2002. The spectral range covered was 
3900-7600A using a low dispersion 250B grating which pro- 
vided a dispersion of 286A mm^^ (3.6A pixeP^) and a spec- 
tral resolution of ~ 11. 3A. Typical observation times were 
between 1.5 hours and 2 hours per field. An average of 50 
6QZ objects were observed in each 6dF pointing. A me- 
dian S/N = 16 per pixel was obtained in the continuum 
of 6dF spectra (see Fig. lb). This higher S/N was set by 
our requirement to effectively identify the much larger num- 
bers of contaminating stars in the 6QZ. A number of 6QZ 
targets where also observed in preliminary survey work car- 
ried out between September 1996 and December 1999 with 
the FLAIR spectrograph (Watson & Parker 1994) on the 
UKST prior to the commissioning of 6dF. These FLAIR 
spectra cover the range 3800-7300A using the same 250B 
grating with ~ 11 A resolution, but are generally lower S/N 
than the 6dF data. All but a few (28) QSOs found with 
FLAIR have been re-observed with 6dF. FLAIR spectra are 
not available as part of the publically available database. 
Observations were obtained for candidates in both declina^ 
tion strips. However, the 6dF observations of the equatorial 
strip candidates are quite incomplete (less than 60 per cent 
of the candidates were observed), and so only the SGP strip 
is included in the final catalogue. In total, 25 overlapping 
6dF pointings were used to cover the SGP strip. 

2.3 Catalogue composition 

2.3.1 The 2QZ catalogue 

2dF data were reduced at the telescope using the 2dF data- 
reduction pipeline 2DFDR (Bailey et al. 2003; Bailey & 
Glazebrook 1999) and information on the spectroscopic com- 
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Table 1. 2QZ/6QZ catalogue composition as a function of minimum sector completeness level (see text). The 70, 80 and 90 per cent 
columns give the properties of survey sub-samples which have those spectroscopic completenesses respectively. We list the number of 
objects in each of the different quality classes (Ql, Q2 and Q3). The final row gives the fraction (compared to the total number of 
observed sources) of IDs in each of the specified sub-samples and quality classes. 



2QZ 6QZ 







All 




70 


per cent 




80 


per cent 




90 


per cent 






All 




Class. 


Ql 


Q2 


Q3 


Ql 


Q2 


Q3 


Ql 


Q2 


Q3 


Ql 


Q2 


Q3 


Ql 


Q2 


Q3 


QSO 


22655 


683 




20905 


480 




18068 


312 




11046 


86 




317 


5 




NELG 


4484 


74 




4086 


50 




3458 


31 




2005 


10 




50 







gal 


79 


16 




73 


11 




59 


6 




34 


1 




1 







star 


10904 


1388 




9965 


1007 




8466 


676 




4996 


168 




1148 


4 




cont 


102 


52 




84 


41 




68 


28 




24 


12 




3 


1 




?? 






4139 






2749 






1669 






405 






35 


Total 


38224 


2213 


4139 


35113 


1589 


2749 


30119 


1053 


1669 


18105 


277 


405 


1519 


10 


35 


ID Prac 


0.858 


0.050 


0.093 


0.890 


0.040 


0.070 


0.917 


0.032 


0.051 


0.964 


0.015 


0.022 


0.971 


0.006 


0.022 



pleteness achieved on each field was immediately fed back 
into the catalogue. 2dF fields wliicli failed to achieve a pre- 
determined completeness level (either 70 per cent for the 
2QZ or 90 per cent for the 2dFGRS) were scheduled for re- 
observation. In the end 556 out of 573 fields in the two 2QZ 
declination strips were observed. These included 10 fields 
that were kindly added to the equatorial strip when ob- 
servations of 2QZ (and 2dFGRS) sources were made using 
spare fibres from spectroscopic follow up of the Millennium 
Galaxy Catalogue (Lemon, Driver & Cross 2001). 

The coverage map for the 2dF survey is given in Fig. 
2, showing the percentage of 2QZ targets that we obtained 
spectra for. We failed to obtain observations of a few 2dF 
fields in the most Northerly declination band of the equa- 
torial strip at RAs greater than 14^ and a few fields at the 
very extreme edges/corners of both strips. In total, spectra 
were obtained for 44576 objects (93.3 per cent of the input 
catalogue) in the 2QZ, corresponding to an effective area of 
673.4 deg'^. A small number of objects in the catalogue were 
observed not by 2dF, but by other instruments and/or tele- 
scopes. These are also included in our final catalogue. Ill 
equatorial sources with NVSS (Condon et al. 1998) radio de- 
tections and bj < 20 were observed with the Low- Resolution 
Imaging Spectrograph on the Keck II Telescope (Brotherton 
et al. 1998). Also, 69 targets were observed with either the 
Double-Beam Spectrograph (DBS) on the Siding Spring Ob- 
servatory 2.3m Telescope, or the RGO Spectrograph on the 
AAT, as part of the follow up of close (< 20") QSO candi- 
date pairs (Smith 1998). DBS spectra of 20 objects in the 
6QZ were also obtained. 

Objects without spectra in the 2QZ either lie in the 
fields that were not observed by 2dF or in regions of high 
2QZ/2dFGRS candidate surface density where the tiling 
algorithm was unable to configure a fibre observation effi- 
ciently given the minimum fibre-to-fibre spacing restriction 
(~ SOarcscc). However, we note that 2QZ targets were given 
higher priority in the tiling algorithm than 2dFGRS targets, 
in order not to imprint the stronger angular clustering pat- 
tern of the gala:xies onto the QSO distribution. Due to the 
significant amount of overlap between 2dF fields and the re- 
peat observations of low completeness fields, 10528 objects 
(23.6 per cent) in the 2QZ have more than one spectroscopic 
observation. 

Once reduced, 2QZ spectra were classified using the AU- 



TOZ program (see Croom et al. 2001b) which uses a x^- 
minimization technique to fit each spectrum to a number 
of QSO (BAL and non-BAL), galaxy and stellar templates 
and measure a redshift for all extra-galactic identifications. 
The QSO template was based on the composite spectrum of 
Francis et al. (1991). AUTOZ produces a single identification 
based on the best fit template in one of six categories based 
on the following spectral criteria: 

QSO: Broad (> 1000 km s"^) emission lines. 

NELG: Narrow (< 1000 km s"^) emission lines only. 

gal: Redshifted galaxy absorption features. 

stcir: Stellar absorption features at rest. 

cont: No emission or absorption features (High S/N). 

??: No emission or absorption features (Low S/N). 

All AUTOZ identifications were independently checked 
visually by at least two members of the 2QZ team to correct 

any identifications that were clearly in error. Approximately 
5 per cent of the AUTOZ identifications were changed in this 
fashion. As part of the classification process a quality fiag 
was attached to each identification and redshift measure- 
ment as follows: 

Quality 1: High-quality identification or redshift. 
Quality 2: Poor-quality identification or redshift. 
Quality 3: No identification or redshift assignment. 

The quality flag was determined independently for the 
identification and redshift of an object. For example, a qual- 
ity 1 QSO identification could have a quality 1 or 2 redshift. 

Table 1 gives the breakdown between the numbers iden- 
tified in each class in the 2QZ. Fig. 3 shows the number- 
redshift relation, n{z), for the QSOs and NELGs in the full 
2QZ. The n{z) histograms for both classes of objects are 
relatively smooth, indicating that there are no strong bi- 
ases towards the selection of QSOs/NELGs over specific red- 
shift intervals. The decline in QSO numbers at 2 < 0.4 and 
z > 2.2 are largely due to incompleteness in the colour and 
morphological selection process (see Section 3). We show the 
number counts, n(6,j), for each identification class (omitting 
the small numbers of 'cont' and 'gal' identifications for clar- 
ity) in Fig 4. The surface densities are based on the observed 
counts divided by effective area of the full survey, with no 
further correction for incompleteness. At magnitudes fainter 
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Figure 3. Numbcr-rcdshift histograms for the 2QZ (upper panel) 
and 6QZ (lower panel) showing both QSOs (solid lines) and 
NELGS (dashed lines). The unshaded solid line includes all qual- 
ity 1 and 2 QSOs, while the shaded histograms denote quality 2 
QSO identifications only. Bin widths are dz = 0.025 and dz = 0.25 
for the 2QZ and 6QZ respectively. 



than 6j ~ 18.5 mag the QSOs axe the largest population, 
while at brighter magnitudes, galactic stars dominate. 

The 'cent' identification relates to potential BL Lac 
sources, however this class of object could be contaminated 
by other weak- lined objects (e.g. DC white dwarfs). To in- 
dicate the uncertainty inherent in this classification, in our 
final catalogue all 'cont' IDs have quality 2. A more detailed 
investigation of possible BL Lacs in the survey has been 
carried out by Londish et al. (2002). 

Of the 44576 objects with spectroscopic observations in 
the full 2QZ, 38224 received quality 1 IDs (85.8 per cent) 
2213 quality 2 IDs (5.0 per cent) and 4139 quality 3 IDs (9.3 
per cent). 

We obtained a quantitative assessment of the quality 
flags from the objects with repeat observations. Based on 
the 5073 objects with quality 1 identifications and redshifts 
in two observations; 186 (3.7 per cent) had a different iden- 
tification and different redshift. A further 38 (0.75 per cent) 
objects had a different identification, but the same redshift 
(to within A« = 0.015); arising as a result of a classification 
change from NELG to QSO or vice versa. 2697 objects with 
quality 1 IDs were classified as QSOs in both the observa- 
tions, of these 2589 (96.0 per cent) had the same redshifts. 
Amongst the 615 objects classified as NELGs in both obser- 
vations, 609 (99.0 per cent) were assigned the same redshift. 

Of the 1026 objects with a quality 1 identification and 
redshift, but also a quality 2 identification, 334 (32.6 per 
cent) had a different identification and redshift. 21 (2.0 per 
cent) had a different identification but the same redshift. Of 



Figure 4. Number- magnitude counts for QSO (filled circle), 
NELG (filled triangle). Star (open triangle) and ?? (open circle) 
classifications in the 2QZ and 6QZ. Poisson error bars are shown. 



the 265 (14) objects that were classified as QSOs (NELGs) 
in both observations 185 (12) had the same redshift, or 70 
(86) per cent of the respective populations. 

Overall, we therefore conclude that the quality 1 iden- 
tifications and redshifts are more than 95 per cent reliable, 
while the quality 2 identifications and redshifts are only re- 
liable at the 70 per cent level. 

We also obtained an estimate of the likely composition 
of the quality 3 object catalogue by looking at the 2735 qual- 
ity 3 objects that were subsequently re-observed and given a 
quality 1 identification and redshift. Of these objects, 1611 
(58.9 per cent) were classified as QSOs, 902 (33.0 per cent) 
as stars, 212 (7.8 per cent) as NELGs and 10 (0.4 per cent) as 
gals. This is close to the distribution of classifications in the 
full 2QZ (excluding ?? objects, see Table 1), with a slightly 
higher fraction (4 per cent) of stars and a lower fraction of 
NELGs amongst the unidentified objects. QSOs remain at 
an approximately constant 59 per cent of sources amongst 
both the identified and unidentified populations. 

We can also increase the mean completeness of the 2QZ 
catalogue by limiting the sectors formed from the overlap 
of individual 2dF observations included in the catalogue to 
only those which meet a specified spectroscopic complete- 
ness threshold (see Section 3 below). Table 1 lists the com- 
position of the catalogue based on a number of spectroscopic 
completeness thresholds set at 70 per cent, 80 per cent and 
90 per cent (based on the percentage of quality 1 identifica- 
tions in a given sector). By spectroscopic completeness we 
mean the fraction of spoctroscopically observed sources for 
which we obtained a quality 1 identification. In most analy- 
sis that we have carried out with the 2QZ, we have used the 
70 per cent sector spectroscopic completeness threshold to 
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Figure 5. Redshift compajrison for extra-galactic emission line objects in the 2QZ. a) zi vs. 22 for QSOs. The dashed lines indicate the 
redshift differences caused by the confusion of specific QSO emission lines, b) Relative diflference vs. mean z for QSOs. c) zi vs. 22 for 
NELGs. d) Relative difference vs. mean 2 for NELGs. The quantization seen in b) and d) at low redshift is due to the redshifts being 
determined to a precision of 4 decimal places (~ 30kms~^). In all cases the determined redshift errors are considerably larger than this. 



define the sample. Furthermore, we have only used objects 
with a quality 1 identification. These criteria were chosen 

to give the best compromise between maximizing the sam- 
ple size (over 20000 QSOs) and minimizing spectroscopic 
incompleteness (11 per cent) in the 2QZ. 

The repeat observations also provide a useful method 
to determine redshift accuracy. The RMS pairwise disper- 
sion between redshift measurements for the 2589 QSOs with 
quality 1 redshifts and Az < 0.015 (excluding the objects 
with an incorrect redshift due to line mis-identification) is 
a{Az) — 0.0038; giving a moan redshift error of (j{z) = 
0.0027 for an individual measurement. We note, however, 
that the dispersion increases as a function of redshift, from 
a{z, z < 1) = 0.0022 to a{z, z > 2) = 0.0044. A more z- 
independent estimate of the redshift error is given by the 
fractional error in the redshift a{z)/z = 0.0027, which is ac- 
tually identical to the mean of aiz). A plot of the redshifts 
for the repeat observations with quality 1 identifications and 
redshifts is shown in Fig. 5. From this we can also see that 
the most common mis-identification was between the C IV 
A1549 and Mg II A2798 lines. For NELGs, the RMS disper- 
sion in the redshifts a{Az) = 0.0005, corresponding to a 
mean error in an individual redshift measurement error of 
a{z) = 0.00035. 



2.3.2 The 6QZ catalogue 

An identical procedure was followed to reduce the 6dF data 
and carry out the spectroscopic identification of the 6QZ. In 
total, 6dF spectra were obtained for 1564 of the 1657 colour- 
selected candidates (94.4 per cent) in the SGP strip, giving 
an effective area of 333.0 deg'^. 

The composition of the 6QZ catalogue is given in Ta- 
ble 1. The 6QZ contains a much higher fraction of galac- 
tic stars (73.7 per cent of observed sources) and a corre- 
spondingly lower fraction of QSOs (20.6 per cent) given its 
brighter magnitude limit. The number-redshift relation for 
the QSOs and NELGs in the 6QZ is shown in Fig. 3. The 
n(6j) relation is shown in Fig. 4. We note that the QSO 
n(6j) relation fits smoothly onto that obtained for the 2QZ 
at bj = 18.25, while the stellar n(fej) relation obtained for 
the 6QZ lies approximately a factor 2 lower than expected 
from an extrapolation of the 2QZ stellar counts. This re- 
flects the more restrictive colour cut, {u — bj) < —0.50, used 
to select the 6QZ candidate list. At this cut, the vast ma- 
jority of the stars identified have been scattered into this 
sample by photometric errors. This explains the increase in 
the stellar surface density at the very brightest magnitudes 
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Table 2. Format for the 2QZ catalogue. The format entries are based on the standard FORTRAN format descriptors. 



Field 


Format 


Description 


Name 


a20 


lAU format object name 


RA 


12 12 f5.2 


RA J2000 (hh mm ss.ss) 


Dec 


ali2 i2 f4.1 


Dec J2000 (±dd mm ss.s) 


Catalogue number 


i5 


Internal catalogue object number 


Catalogue name 


alO 


Internal catalogue object name 


Sector 


a25 


Name of the sector this object inhabits 


RA 


12 12 f5.2 


RA B1950 (hh mm ss.ss) 


Dec 


ali2 12 f4.1 


Dec B1950 (±dd mm ss.s) 




13 




XAPM 


f9.2 


APM scan X position {~ 8 fj.m pixels) 


YAPM 


f9.2 


APM scan Y position (~ 8 fim pixels) 


RA 


fll.8 


RA B1950 (radians) 


Dec 


fll.8 


Dec B1950 (radians) 


b} 


f6.3 


fej magnitude 


u-bj 


f7.3 


u — 6j colour 


bj —r 


f7.3 


6,j — r colour [including r upper limits as: (6j — rn^) — 10.0] 


Nobs 


11 


Number of observations 


Observation # 1 








16.4 


Redshift 


qi 


12 


Identification quality x 10 + redshift quality 


IDi 


alO 


Identification 


datei 


a8 


Observation date 


fldi 


14 


2dF field number x 10 + spectrograph number 


fibre 1 


13 


2dF fibre number (in spectrograph) 


S/Ni 


16.2 


Signal-to-noise ratio in 4000-5000A band 


Observation # 2 






Z2 


f6.4 


Redshift 


q2 


12 


Identification quality X 10 -|- redshift quality 


ID2 


alO 


Identification 


date2 


aS 


Observation date 


fld2 


14 


2dF field number X 10 -|- spectrograph number 


fibre2 


13 


2dF fibre number (in spectrograph) 


S/N2 


f6.2 


Signal-to-noise ratio in 4000-4900A band 


Zprev 


f5.3 


Previously known redshift (Vcron-Cetty & Veron 2000) 


radio 


f6.1 


1.4GHz Radio flux, mjy (NVSS) 


x-ray 


f7.4 


X-ray flux, xlO'^^ ergs-^cm-^ (RASS) 


dust 


fr.5 


E{B - V) (Schlegel et al. 1998) 


comment 1 


a20 


Speciflc comments on observation 1 


comment2 


a20 


Speciflc comments on observation 2 



(6j < 17 mag), where the photometric errors on the photo- 
graphic magnitudes increase. 

The 6QZ also has a much higher spectroscopic com- 
pleteness level (over 97 per cent for quality 1 identifications) 
than the 2QZ and so we choose to make no further cuts on 
the basis of sector completeness as we did for the 2QZ. The 
higher completeness levels are directly attributable to the 
much higher mean S/N obtained for the 6dF observations. 

Repeat observations were obtained for 475 objects in 
the 6QZ. Based on a similar comparison of the quality 1 
identifications and rodshifts to that carried out for the 2QZ 
sample, we conclude that the quality 1 identifications in the 
6QZ are 99 per cent accurate (only 2 out of 254 repeat qual- 
ity 1 identifications changed between observations) and the 
quality 1 redshifts are over 96 per cent accurate (only 6 out 
of 177 QSO redshift measurements differed by more than 
Az = 0.015, and of these, only one QSO had Az > 0.03). 

We obtain a redshift error of a{z) = 0.0026 for the 6QZ 
QSOs, identical to that obtained for the 2QZ, based on a 



comparison of the redshifts obtained for the 171 QSOs in the 
6QZ with repeated quality 1 observations with Az < 0.015. 



2.4 Data Products 

The 2QZ and 6QZ catalogues arc available as ASCII files 
from the 2QZ WWW site, http : //www . 2df quasar . org, and 
on a CD-ROM release. The catalogue format is identical for 
both surveys and is given in Table 2. Note that the catalogue 
format has been extended to include more information since 
the preliminary data release (Croom et al. 2001b). The first 
part of a catalogue entry contains details from the input cat- 
alogue, such as position and magnitude. We note that the 
object names may in some cases not correspond exactly to 
the source position in a and 5 as we have improved the as- 
trometry of the catalogue (Smith et al. 2003) since the object 
names were defined. The names have not been changed sub- 
sequently, to avoid confusion with previously published lists. 
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We now include internal catalogue names and numbers, al- 
though objects should generally be referenced by their main 
lAU format name. A new entry is included for the name of 
the sector inhabited by the object. A sector is defined as the 
intersection of overlapping 2dF fields (see Section 3). The 
format of these is e.g. S_200_201_247, where the S denotes 
the SGP strip and the numbers indicate that the sector is 
formed by the overlap of fields 200, 201 and 247. There are no 
sectors defined for the 6QZ. Coordinates are also supplied in 
the B1950 system, as the survey was constructed in this sys- 
tem and the production of completeness masks etc. is more 
straight forward in this system. We also note that sources 
which had only upper limits (i.e. non-dctcctions) on the r 
plates arc also included in the catalogue and have a listed 
fej — r colour. In this case the colour term is (&j — rum) — 10, 
the —10 being used to differentiate upper limits from normal 
colours (objects with real r-band detections have colours in 
the range —1.4 < bj — r < 3.4, while upper limits have 
bj - r < -9.8). 

The input catalogue information is followed by details 
of observations and identifications. As discussed above, in a 
number of cases we have two or more 2dF observations of the 
same source. The first and second observations (arranged in 
quality order) are listed in the main catalogue. These are 
useful to assess the quality of the final catalogue. The iden- 
tification and redshift that we adopt is that from the obser- 
vation with the numerically lowest quality value (where the 
quality value is identification quality x 10 + redshift qual- 
ity), and if there are equal quality values, the highest S/N. 
In all cases the final adopted ID is listed as observation #1, 
with the lower quality observation being listed as observa- 
tion #2. We also list the 2dF field number (fldi and fld2) 
for each observation. These correspond to the actual 2dF 
field number xlO -|- the spectrograph number (1 or 2 for 
2dF, 6 for 6dF). Sources with spectra from the Keck ob- 
servations of 2QZ radio sources all have field number 8881. 
Sources from the foUowup of close pairs have field number 
7771. Sources from the 6QZ with DBS observations have 
field number 6666. We also include cross-matches to selected 
other databases. Where a 2QZ/6QZ source matches the po- 
sition (to within 6") of a previously known QSO/AGN in 
the catalogue of Veron-Cetty & Vcron (2000) we include 
the previously known redshift. We also include radio fiuxes 
at 1.4GHz from the NRAO VLA Sky Survey (NVSS; Con- 
don et al. 1998) and X-ray fiuxes from the ROSAT All Sky 
Survey (RASS; Voges et al. 1999; Voges et al. 2000), convert- 
ing from RASS counts per second to fiux (in ergs~^cm~^) 
by multiplying by 1.2 x 10~^^ . The matching radii used were 
15" and 30" for NVSS and RASS respectively. An estimate 
of the galactic reddening E{B — V) to each source is taken 
from the work of Sclilcgcl, Finkbcincr & Davis (1998). In or- 
der to convert from E{B — V) to extinction values multiply 
by 5.434, 4.035 and 2.673 for Au, Ab^ and Ar respectively. 
Lastly we supply any comments made on the particular ob- 
servation which might fiag the object or spectrum as un- 
usual. 

For the catalogue we have not attempted to sub-divide 
the NELG class into LINERs, Scyfcrts, starbursts, etc. 
Given the finite spectral window of the 2QZ and 6QZ ob- 
servations and the large redshift range observed for NELGs 
{0 < z < 0.6), in many case key diagnostic lines required for 
further spectral classification are missing from the spectra. 



The spectral coverage provided by 2dF and 6dF limits our 
ability to detect broad Mg II emission until z > 0.35. In some 
QSOs with 0.35 < z < 0.5, broad Mg II is clearly detected 
with no corresponding broad H/3 seen. Thus at lower red- 
shifts, 0.15 < 2: < 0.35, some objects classified as NELGs on 
the basis of a narrow H/3 line may exhibit broad Mg II below 
the blue limit of the 2dF spectral window. For QSOs with 
2 > 1.6, AUTOZ fits both 'normal' and BAL QSO templates. 
Objects identified as BALs are indicated by 'QSO(BAL)' in 
the identification column. However, due to the varied nature 
of BAL QSOs, this will not be a comprehensive list of BALs. 
A more detailed search for BALs has not yet been applied. 

Within the objects we classify as stars, in the main we 
only give classifications for white dwarfs (WDs). We have 
not attempted to provide spectral classification of main se- 
quence stars. The ~ 2000 WDs axe mostly DAs with strong 
broad hydrogen Balmer absorption lines. At low S/N these 
can be confused with A stars. Only detailed fitting of the 
temperature and gravity of these stars can easily resolve 
this issue, and this is beyond the scope of the present paper. 
As well as DAs we also find a number of DB and DO WDs 
which are dominated by neutral and singly ionized helium 
respectively. A small number of DZs are also found, with 
broad Ca II H and K AA3969, 3933 absorption. The only 
other stellar types we classify are emission line stars with 
strong hydrogen Balmer emission which we classify as CV 
(cataclysmic variable) and WD + M dwarf binary systems 
which are denoted as DAM, DBM etc. We note that our code 
has been designed primarily to simply identify an object as 
a star (and therefore not a QSO), and that a more detailed 
analysis would no doubt provide more accurate classifica- 
tions. Some analysis of the Galactic stars in the 2QZ has 
already taken place (Venues et al. 2002; Marsh et al. 2002). 
The above classifications axe denoted within the catalogue in 
parentheses after the main star identification [e.g. star(DB), 
star(CV)]. 

FITS format spectra for all objects, including repeat 
observations and spectra from which no identification was 
possible, are provided as a primary data product of the 2QZ 
and 6QZ surveys at http://www.2dfquasar.org/ and on 
the survey CD-ROM. Note that many of the repeat spectra 
are of low quality, as the fields were repeated if they had low 
completeness. A small fraction of spectra have bad 'fringing' 
caused by a damaged fibre. This shows up as a strong oscil- 
lation as a function of wavelength, rendering the underlying 
spectrum urmsablc. In a few cases 2DFDR failed to cope ad- 
equately with the sky subtraction process at either end of 
the CCD datar-frame and so a number of spectra have poorly 
subtracted night sky emission (and in some cases moonlight) 
which has also affected our ability to identify an object in 
some cases. The spectral database does not include spectra 
from FLAIR observations. It also does not include spectra 
from the small number of 2QZ identifications not based on 
2dF spectra (see Section 2.3.1). 

On the WWW site, we also provide 1-arcmin square 
SuperCOSMOS-scanned images (Hambly et al. 2001) of the 
J photographic plates around each object in the 2QZ and 
6QZ catalogue. Due to space restrictions, these images are 
not available on the CD-ROM. The input catalogue is also 
available from this site (Smith et al. 2003). The input cat- 
alogue, spectroscopic catalogue, FITS spectra and photo- 
graphic images define the primary data products of the 2QZ 
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and 6QZ surveys. To aid in the interrogation and analy- 
sis of the catalogue, we also provide a web search tool, 
scripts/code used to generate the survey mask for an ar- 
bitrary minimum sector completeness and the a,5 oi the 
survey boundarios/holos drilled around bright stars in the 
input catalogue. We also provide quantitative estimates of 
catalogue completeness for QSOs, where appropriate, as a 
function of magnitude, redshift and position in 2QZ and 
6QZ catalogues. The empirical methods used to define these 
completeness estimates, arising from a variety of sources are 
discussed in turn in the following section. 



3 SURVEY COMPLETENESS 

We now come to discuss the survey completeness, and 
sources of incompleteness. We define four separate types of 
completeness that can be used to describe the 2QZ and 6QZ 
surveys. These are: 

• Morphological completeness, fni{bj,z). This de- 
scribes our effectiveness at differentiating between point 
sources and extended sources on UKST plates, and also the 
possible effect of the QSO host galaxies on morphology at 
low redshift. 

• Photometric completeness, fp(bj,z). This attempts 
to take into account any QSOs which may have fallen outside 
of our colour selection limits. 

• Coverage completeness (or coverage), fc{6)- We 
define the coverage as the fraction of 2QZ catalogue sources 
which have spectroscopic observations. 

• Spectroscopic completeness, /s(fcj, 2:, 5). This is the 
fraction of 2QZ catalogue objects which have the specified 
quality of spectrum (in our case we consider quality 1 iden- 
tifications). 

We indicate against each type of completeness above 
whether it is considered to be a function of angular posi- 
tion 0, magnitude bj, or redshift z. To provide a general 
description of the angular positional dependence of com- 
pleteness we determine completeness values within each of 
the regions which are described by sets of our overlapping 2° 
diameter fields. Within each of these regions, which we call 
sectors, we may define a coverage and spectroscopic com- 
pleteness value. For ease of use these completeness maps are 
then pixelized on a scale of 1 arcminute. Below we describe 
each of these different completeness measurements and the 
corrections which can be applied to account for them. For 
some analyses it is appropriate to define such completeness 
on an object-by-object basis. 

3.1 Morphological completeness 

Morphological completeness falls into two categories. The 
first concerns objects which are truly point-sources, but 
which have been mis-classified by the APM analysis soft- 
ware. The second concerns QSOs which have significant host 
galaxy contributions, which make their images non-stellar. 
This is an issue at low redshift {z < 0.4). 

The first of these two classes of morphological complete- 
ness is by far the easiest to quantify. A small fraction of stel- 
lar objects are classified as merged images or galaxies by the 
APM on the basis of random errors or overlapping images 



on the photographic plate. The fraction of sources classi- 
fied as non-stellar (/ns) and lost in this fashion has been 
determined by direct comparison with the SDSS Early Data 
Release (Stoughton et al. 2002) imaging data in the equa- 
torial region where the surveys overlap (Smith et al. 2003) 
and is well described by 

/ns = 0.064 + 0.0052(6j - 16). (1) 

/ns thus varies between 0.064 and 0.089 over the full mag- 
nitude range covered by the 2QZ and 6QZ surveys. We see 
no systematic variations in /ns as a function stellar density, 
which varies along the survey strip. 

Since QSO candidates were selected on the basis of 
their stellar appearance on photographic plates, low redshift 
QSOs with detectable host galaxies on the plate will also 
be preferentially de-selected from the final input catalogue. 
The visibility of QSO host galaxies on photographic plates 
is extremely difficult to model; given the large variation in 
host galaxy properties and the large scatter in the rela- 
tion between host and nuclear luminosities (Schade, Boyle & 
Letawsky 2000). However, the 2QZ should be relatively free 
of morphological bias against 'extended' QSOs for z > 0.4, 
at which even the largest host galaxies (to — lOkpc), will 
have angular size of less than 2 arcsec, and therefore appear 
unresolved on the photographic plates with stellar image 
size typically 2-3axcsec. This is further substantiated by 
Hewett, Foltz & Chaffee (1995) who find an excess of ex- 
tended sources in the Large Bright Quasar Survey (LBQS), 
only at « < 0.4. Also, Meyer et al. (2001) find that 4 out of 
71 QSOs found in the Fornax Cluster Spectroscopic Survey 
(FCSS) are extended from APM scans, but that these are 
generally mis-classifications due to nearby bright stars and 
galaxies. It should be noted however, that the LBQS and 
FCSS have brighter fiux limits, bj ~ 18.8 and 6j ~ 19.7 re- 
spectively, than the 2QZ. With brighter nuclei, the effect of 
the host galaxies in determining the observed morphology 
will be reduced. 

We therefore choose not to derive any completeness cor- 
rection for the morphological effect of host galaxies. For 
many applications where the QSO space density is normal- 
ized to a local mean (e.g. clustering) any z-depcndent bias 
(morphological or otherwise) can be corrected as part of nor- 
malization to the observed n{z) relation as long as that bias 
is a smooth, slowly varying function of redshift. 

3.2 Photometric completeness 

Photometric incompleteness is caused by errors in the photo- 
graphic magnitudes (~0.1 mag in each band) and variability 
(due to the non-contemporaneous nature of the ubjr plates 
on each field) which cause QSOs to exhibit u — bj /bj — r 
colours outside our selection criteria. It could also be the 
case that we are missing some fraction of the QSO popu- 
lation that are intrinsically red, however the fraction of red 
QSOs in a 6,7 flux limited sample is likely to be small (Meyer 
et al. 2001). Hewett, Foltz & Chaffee (2001) also find that 
the fraction of objects missed from the blue selected LBQS 
is only 13 ± 4 per cent, using matches to the FIRST radio 
survey (Becker et al. 1995). Given the uncertain nature of 
any red QSO population we chose to base our photometric 
completeness estimates on the likelihood of detecting typi- 
cal blue QSOs. In order to fully quantify the incompleteness 
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Figure 6. The colour rcdshift relations for the 2Q,7j/&Q,Z sample. Top; u — 6j vs. z for all quality 1 QSOs. The dotted line indieatcs the 
?t — 6 J = —0.36 selection limit. Bottom: 6j — r vs. z for quality 1 QSOs. The connected filled circles show the mean colours in Az = 0.1 
bins. The solid line describes the colours derived from the SDSS composite spectrum, zero-pointed to the 2QZ at z = 1 — 2. The dashed 
line describes the mean colours of the Hawkins (priv. comm.) variability selected sample. 
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Figure 7. Colour-colour plots for 2QZ/6QZ QSOs in rcdshift bins 
of Az = 0.5. The solid lines indicate our colour selection limits. 
At low redshift (0.0 < z < 0.5) the QSO colours are redder in 
bj — r, therefore the u — bj cut has greater impact on selection. 
At high redshift (2 > 2) the QSO colours become increasingly 
red, until the predicted centre of the locus is within the region 
excluded by our colour selection limits. 



due to our colour selection we have adopted a semi-empirical 
approach. This allows us to make use of the measured prop- 
erties of the 2QZ QSOs as well as theoretical estimates of 
QSO colours. Wo derive the mean QSO colours in u — 6,1 and 
6j — r as a function of z and bj and use this in combination 
with the measured dispersion of 2QZ QSOs about the mean 
to model the colour distributions. Applying our selection cri- 
teria to these model QSO colours allows us to determine the 
fraction of QSOs that would have been selected. 

Our first step is to derive the mean colour-redshift rela- 
tions for the 2QZ. The u — bj and bj —r colours of 2QZ/6QZ 
QSOs are shown in Fig. 6 as a function of redshift (small 
points), also shown are the mean colours in Az = 0.1 inter- 
vals (connected filled circles). The u—bj vs. z colours show 
slow variations due to various emission lines moving in and 
out of the bands, and an up-turn at z = 2 where Lyman-a 
moves from the it-band to the &j-band, and the tt-band be- 
comes increasingly affected by Lyman-a forest absorption. 
At low redshift {z < 0.5) we also see evidence of a move 
red- wards in the QSO u — bj and 6j — r colours. This is most 
noticeable in the mean bj — r, as sources with redder u — bj 
colours tend not to be selected. Indeed, while at most red- 
shifts there is little evidence of a sharp drop in QSO numbers 
redder than the u — bj cut (dotted line), this is not the case 
at ^; < 0.5 where most QSOs selected are bluer than the 



u — bj cut. This effect is most clearly seen in Fig. 7 which 
displays the u — bj vs. 6j — r colours of 2QZ/6QZ QSOs in 
redshift bins of width Az = 0.5. At z < 0.5 the QSO locus 
is more affected by the u — bj colour cut, due to the redness 
of the b] —r colours. This reddening at low redshift is prob- 
ably due to the increasing amount of contamination from 
the host galaxies of the QSOs, confirmed by the appearance 
of host galaxy features (Ca II H & K) in the spectra of low 
redshift QSOs in the 2QZ (Groom et al. 2002). At high red- 
shift {z ^ 2.5 and greater) mean QSO colours lay within the 
region excluded from our analysis by our colour selection 
limits, and QSOs that are detected are found close to the 
colour selection limits. 

We then compared the 2QZ colours to those derived 
from the SDSS composite QSO spectrum (Vanden Berk et 
al. 2001), taking the SDSS composite and convolving it with 
the u, b} and r band passes (using those specified by Warren 
et al. (1991) and references therein), and then zero-pointing 
the magnitudes to the mean 2QZ colours at z = 1 — 2. The 
required shifts are an addition of 0.16 and —0.08 mag to the 
model u — bj and bj — r colours respectively. We have ex- 
amined the uniformity of the 2QZ colours by comparing the 
median QSO colours in each UKST field. The rms scatter 
found is 0.03 mags, fully consistent with the minimum er- 
rors typically achievable with photographic plates. Detailed 
tests of the procedure used to generate the synthetic colours 
found no significant errors. The exact source of the above 
zero-point offsets in unclear, but as the 2QZ photometry 
is internally consistent and our completeness estimates are 
largely empirical these offsets should not cause any system- 
atic errors in our analysis of the survey completeness. 

We see that the u—bj and bj—r SDSS colours (solid lines 
in Fig. 6) are very close to those of the 2QZ over the redshift 
range z = 0.5 — 2.0. At higher redshift, the SDSS colours are 
redder than those from the 2QZ, as Lyman-a absorption 
progressively reddens the QSO colours. This demonstrates 
that the 2QZ completeness starts to decline past 2: = 2 as we 
preferentially select only the bluest QSOs at these redshifts. 
We also note that the SDSS colours are bluer in both u — bj 
and bj — r than the 2QZ at z < 0.4. We attribute this to the 
decreased effect of the host galaxy in the higher luminosity 
(typically 1 mag) QSOs sampled by the SDSS survey; based 
on the fiat relation between nuclear and host luminosities 
for QSOs (Lqso oc L°Ji~° '^) (Schade et al. 2000; Groom 
et al. 2002). In the SDSS composite this could also be due 
to contamination at low redshift by narrow emission line 
gala^xies (Vanden Berk et al. 2001), £is well as the fact that 
because there are many more QSOs found at high redshift, 
the higher redshift QSOs (with less contribution from the 
host galaxy) will dominate the construction of the composite 
spectrum. 

We also compare the 2QZ/6QZ QSO colours to a sam- 
ple of QSOs which have been selected on the basis of their 
variability (Hawkins & Veron 1995; Hawkins private com- 
munication). We derive the mean u — bj and bj — r colours 
for Hawkins QSOs with bj < 21 as a function of redshift and 
plot them in Fig. 6 (dashed lines). They are zero-pointed to 
the mean 2QZ/6QZ colours at z = 0.7 - 2.0. These shifts 
arc slightly larger than those derived above for the model 
colours based on the SDSS QSO composite and are an addi- 
tion of —0.27 and —0.16 mag to the derived u — bj and bj — r 
colours of the Hawkins sources) , and may be due to a zero- 
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Figure 8. Fitting of the u — 6j vs. 6j — r colour distribution for two example magnitude intervals. Top: 18.45 < bj < 18.55, Bottom: 
20.45 < fe,j < 20.55. From left to right the plots arc: a) colour distributions relative to the mean for QSOs with Icr and 2a fitted contours. 
A'q is the number of QSOs in the plot, b) Grcy-scalc of the same distribution, showing the fitted a values, c) A random realization of 
the fitted model with 10000 points and an estimated completeness value, d) A K-S test between the radial distributions of the QSOs 
and random points. 



point offset in the Hawkins photometry. Once this zero-point 
shift has been accounted for the mean Hawkins colours fol- 
low closely the moan 2QZ/6QZ colours at 0.5 < z < 2.3. At 
higher redshift they become redder than the 2QZ/6QZ, as 
do the SDSS colours. At lower redshift the Hawkins bj — r 
colours arc close to the 2QZ/6QZ colours at z > 0.2, but 
the Hawkins u — bj are significantly redder than those of the 
2QZ/6QZ. As the Hawkins variability selected QSOs are not 
selected on the basis of colour, they should, to first order, 
be independent of colour, and thus follow more closely the 
true colour distribution of the QSO population. Wo do note, 
however, that certain issues can affect variability selection, 
such as host galaxy contamination, the correlation of vari- 
ability with luminosity and redshift (e.g. Trevese & Vagnetti 
2002), and these may introduce some colour dependence to 
any variability selected sample. 

The above considerations suggest that we use the fol- 
lowing to define our mean QSO colour-redshift relations: i) 
the mean Hawkins colours for u — bj at z < 0.5; ii) the mean 
2QZ/6QZ colours for u — b.i at 0.5 < z < 2.0 and 6,j — r at 
0.0 < z < 2.0; iii) the SDSS composite colours a,t z > 2.0 for 
u—bj and bj — r. We note that in each case the SDSS com- 
posite and Hawkins colours have had the above zero-point 
shifts applied. 

A final point to consider is whether there is any mag- 
nitude dependence of the mean colours. In particular, if the 
reddening of the bj — r colours at low z is due to increasing 
host galaxy contamination, this efi^ect might be most notice- 
able in intrinsically faint QSOs. We find this to be the case 



for 2QZ/6QZ bj — r colours. The variation in colour can be 
described by 

A(fe,i - r) = -9.17 + 0.4796,7 + 13.1z - 0.6S5b.jz (2) 

at z < 0.70. At higher redshift no significant effect is seen. 
Wc see no similar reddening of the m — 6j colours towards 
fainter magnitudes for low redshift QSOs in the 2QZ/6QZ, 
although some weak dependence would be expected if it is 
the host galaxy which is producing the variation in 6,t — r 
colours. We have tested the expected A(u — bj) for a given 
A(&j — r) by combining the SDSS QSO composite spectrum 
and model galaxy spectra. For a range of galaxy SEDs (e.g. 
single instantaneous bursts of star formation of age between 
3 and 10 Gyr) the typical A{u — bj) is between ~ 0.2 and 
0.5 times A(6j — r) above z ~ 0.2. As the exact nature of 
this correction is difficult to model due to uncertainties in 
the galaxy SED and the fractional contribution of the host, 
and we do not see any evidence of a magnitude dependence 
of the mean u — bj colours, we have decided not to make 
any correction to the mean u — bj as a function of QSO 
magnitude. We therefore correct our mean colour-redshift 
in 6j — r only by an amount defined from Eq. 2, although 
we limit the 6j — r colours to be no bluer than those from 
the SDSS composite. 

Once the mean colour-redshift relations are defined, we 
then determine the dispersion about the mean. In doing this 
we assumed that the dispersion was a function of magni- 
tude only, a reasonable assumption given that photometric 
errors and variability dominate the scatter. Any redshift de- 
pendence of variability was considered to be a second order 
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Figure 9. The semi-major axis (filled circles) and semi-minor 
axis (open circles) dispersions for the 2D Gaussian distribution 

fitted to the colour-colour distributions as a function of magni- 
tude. The solid lines denote the best fit lines used to derive the 
model QSO colour distributions. At bright magnitudes the dis- 
persions are forced to be no less than the mean dispersion found 
at 6j = 17 - 18. 
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Figure 10. Photometric completeness contours in the (6j , z) 
plane for the 2QZ. The contours are plotted at 5 per cent in- 
tervals: dotted lines denote 5, 15, 25,. ..95 per cent completeness; 
solid lines correspond to 10, 20, 30.. .90 per cent. 



effect. The dispersion about the mean u — bj and bj — r 
colours was fit by a 2D Gaussian distribution in A6j =0.1 
intervals between z — 1 and 2. This rcdshift range was cho- 
sen as the regime where completeness was highest, so as to 
give the most accurate description of the QSO colour dis- 
tribution. Two examples of these fits are shown in Fig. 8. 
Regions outside of our colour selection limits were not in- 
cluded in the fitting procedure. Wc first attempted to fit 
the colour distribution with four parameters: peak height, 
semi-major axis (cTa), semi-minor axis ((Tb) and position an- 
gle. However investigation showed that in all cases the po- 
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Figure 11. The ratio of close pairs found in the 2QZ input cat- 
alogue, NNc to those observed with 2dF NN^^^^. At scales less 
that ^ 1 — 2 aJTcmin there is a decrease in the number of pairs due 
to the positioning constrains of the 2dF fibre positioner. The con- 
stant offset below 1 at larger angular scales is cause by the lower 
mean density of observed objects compared to the full catalogue 
(i.e. not all objects could be observed). 

sition angle was close to 45°. The fitting process was then 
repeated fixing the position angle to be 45°. Random dis- 
tributions following the fitted Gaussian were produced to 
confirm that the 2D Gaussian was a reasonable fit to the 
data via a radial Kolmogorov-Smirnov (K-S) test. Only two 
A6j = 0.1 intervals showed a significant difference between 
the data and the model. In general we note that the model 
does generate marginally less extreme outliers in the colour 
distributions, however this does not significantly affect the 
determined completeness values. The measured widths of 
the 2D Gaussian model were fitted by a straight line in the 
range 17 < 6j < 20.85, and these fits used to determine 
the dispersion in our final Monte-Carlo simulations that es- 
timate QSO completeness. At 6,j < 17 there are insufficient 
QSOs to make an adequate fit to their distribution, so the 
dispersions are fixed to be the mean of the dispersion in the 
6j = 17 - 18 range (see Fig. 9). 

Finally we make Monte-Carlo simulations of the QSO 
colour distributions in A6j = 0.1 and Az = 0.1 bins us- 
ing 100000 QSOs in each interval. The fraction of simulated 
QSOs which fall outside our colour selection limits then de- 
fine our incompleteness. The final photometric completeness 
contours are shown in Fig. 10. The photometric complete- 
ness is largely independent of magnitude and is at least 70 
per cent or greater over the rcdshift range 0.4 < z < 2.2. At 
higher redshifts the completeness rapidly drops, falling to 
below 50 per cent at « > 2.3. The bluer u — bj limit imposed 
on the 6QZ {u — bj < —0.5 rather than u — bj < —0.36) 
causes a weak stop in the completeness contours which can 
be seen at bj = 18.25. 

3.3 Coverage completeness 

Of all four types of completeness, at a sector level the cov- 
erage completeness (or coverage) is the simplest to derive, 
being only a function of angular position. We define the cov- 
erage as being the ratio of observed to unobserved sources 
in each of the sectors defined by overlapping 2QZ fields, 
we denote this by fc{0). The coverage maps (pixelized on 1 
arcminute scales) of the two 2QZ strips are shown in Fig. 
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Figure 12. The completeness map of the 2QZ catalogue for the equatorial (top) and SGP (bottom) regions. The grey-scale indicates the 
percentage of 2QZ candidates observed and positively identified (quality 1) over the two survey strips. This is equivalent to fc{6)fs(6)- 



2. These maps also account for holes in the survey regions 
which are due to bright stars or photographic plate de- 
fects/features. The incompleteness in coverage arises from 
the difficulty of configuring all targets onto 2dF fibres us- 
ing the adaptive tiling scheme (Colless et al. 2001). This 
is caused by variations in target density, a small (typically 
~ 1 per cent) number of broken or unusable 2dF fibres, and 
constraints due to the minimum fibre spacing (~ 30 arcsec). 
These issues are in large part alleviated by the large overlaps 
between 2dF fields, however it was still not possible to ob- 
serve 100 per cent of the QSO candidates in the 2QZ. The 
remaining decline in the number of close pairs is demon- 
strated in Fig. 11 which shows the ratio of close pairs from 
observed (NNobs) to those in the input catalogue (NNc). 
This shows a decline in close pairs at angular scales less 
than ~ 1 — 2 arcmin. This has no significant efi^ect on the 
clustering analysis of QSOs (e.g. see Groom et al. 2001a). 
The number of close pairs contained within the input cata- 
logue is also limited by the resolution available on the APM 
scans of our UKST plate material. This reduces the number 
of close pairs on angular scales less than ~ 8 arcsec (Smith 
et al. 2003). 

3.4 Spectroscopic completeness 

3.4-1 Position dependent spectroscopic completeness, 

fsie) 

Spectroscopic completeness is perhaps the most complex of 
all completenesses to describe. In our discussion below we 
focus on the spectroscopic completeness of the QSOs in our 
survey, as they are the primary targets of our observations. 
In general different classes of object will have different spec- 
troscopic completeness functions, e.g. NELGs are more eas- 
ily identified in low S/N spectra than stars. Our assessment 
of QSO spectroscopic incompleteness assumes that there 
are equal fractions of QSOs among both the identified and 
unidentified spectra (see Section 2.3.1). Alternative meth- 
ods may be required to assess the completeness of different 
object classes. 

The distribution of combined coverage and spectro- 
scopic incompleteness, /c(^)/s(^), across the two 2QZ strips 
based on quality 1 IDs alone is shown in Fig. 12 (c.f. Fig. 
2 which shows the coverage incompleteness only). However, 
this only describes the mean spectroscopic completeness in 
each sector (i.e. as a function of angular position). This 



is straightforward to calculate on a sector-by-sector basis, 
based on the ratio Ni/Nobs, where Ni is the number of 
quality 1 IDs in a sector and A'obs is the number of targets 
observed in a sector. By contrast, fc is determined by the 
ratio Nohs/Nc, where A'c is the total number of 2QZ sources 
within the sector. However, while coverage is only dependent 
on angular position, spectroscopic completeness will in gen- 
eral also be dependent on bj and z. For many analysis (e.g. 
the luminosity function, see below), we only require global 
magnitude and redshift dependent corrections. These will 
vary with the minimum sector completeness level applied 
to produce any 'complete' spectroscopic catalogue (see Sec- 
tion 2.1). For example the inclusion of more low mean com- 
pleteness sectors will produce a spectroscopic completeness 
function with a more marked dependence on bj . 



3.4.2 Magnitude dependent spectroscopic completeness, 

Although for many purposes a global magnitude dependent 
completeness correction would suffice, it is also possible to 
determine the magnitude dependence of spectroscopic com- 
pleteness on a sector-by-sector basis. In Fig. 13 we plot the 
spectroscopic completeness as a function of 6j for sectors 
with different mean spectroscopic completeness levels. In 
sectors of high completeness there is very little dependence 
of completeness on 6j, while for less complete sectors (typ- 
ically observed in worse conditions) the completeness has a 
stronger dependence on bj. Following a similar approach to 
Colless et al. (2001) we parameterize this magnitude depen- 
dence as a function of the form 

/s(b.,) = l-exp(6.,-M), (3) 

where p is the single free parameter. Fig. 13 also shows the 
best fit function for each mean completeness interval. This 
simple functional form traces well the dependence of com- 
pleteness on bj magnitude. We can then relate the best fit n 
values to the mean completeness < fs{d) >. This is plotted 
in Fig. 14, and can be fit by the function 

A + Bln{l- < Me) >), (4) 

where A and B are constants to be fit for. Their best fit 
values are 20.388 ± 0.77 and -0.919 ± 0.052 respectively. 
The magnitude dependence of fs is then fully described by 
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Figure 13. Spectroscopic completeness as a function of 6j magni- 
tude, /s(fej), for sectors with different mean completeness values, 
fs{9) (filled circles). These arc plotted for mean sector complete- 
ness intervals 0.98-1.00, 0.95-0.98, 0.90-0.95, 0.85-0.90, 0.80- 
0.85, 0.75-0.80, 0.70-0.75 and 0.65-0.70. In each case the best fit 
magnitude dependent completeness model is plotted (solid line), 
and the best fit value of n is shown. 



/.(6j) = l- 



exp(6j - A) 



(5) 



[1 - fsm" ■ 

We are then able to combine this with the mean spectro- 
scopic completeness within a sector, fs{0), to derive the full 
position and magnitude dependent spectroscopic complete- 
ness correction, 



(6) 



The value Nest (9) is an estimate of the number of sources 
with quality 1 IDs given the function /s(6j) for a particular 
sector, and is given by 



(7) 



Therefore in order to derive the positional and magnitude 
dependence of the spectroscopic completeness of the 2QZ 
we require the mean /s value in each sector, fs{d), and the 
derived A^ost (0) value for each sector. These are provided as 
part of the 2QZ data release. 

The simple functional form for /s(fc,j) in Eq. 3 can 
be fit to the full 2QZ catalogue, and this results in /i = 
22.168 ± 0.026. Given the relation between fs{0) and /j de- 
rived above, the overall spectroscopic incompleteness of 85.7 
per cent would imply a value of fj, = 22.18, in excellent 
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Figure 14. The magnitude dcijciidcut sijcctroscopic complete- 
ness parameter as a function of mean sector spectroscopic com- 
pleteness, < /s(S) > (solid points). These are the best fit parame- 
ters for the fits shown in Fig. 13. The solid line shows the function 
used to describe the dependence of on < fa{d) >■ 
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Figure 15. The area of the 2QZ as a function of mean sector 
spectroscopic completeness, fs{d)- 



agreement with the directly fitted value. The 2QZ catalogue 
limited to sectors with at least 70 per cent quality 1 iden- 
tifications gives /i = 22.407 ± 0.037. This 70 per cent cat- 
alogue, based on Table 1, provides a suitable compromise 
between the need to achieve relatively high level of mean 
overall spectroscopic completeness (89 per cent), whilst re- 
taining as many of the original quality 1 QSOs in the final 
catalogue (20905) as possible (sec Fig. 15). 

The 6QZ sample has a mean spectroscopic complete- 
ness of 97.1 per cent, considerably higher than that of the 
2QZ. This mean correction should be sufficient to account 
for spectroscopic incompleteness, as there is only weak (un- 
significant) magnitude dependence on completeness in the 
6QZ. However, if we do apply a similar analysis to that 
described above, and fit Eq. 3 to the 6QZ over the range 
16 < 6j < 18.25 we find = 21.041J (see Fig 16). This 
can also be used to describe magnitude dependent spectro- 
scopic completeness in the 6QZ. Note that the relation be- 
tween fj, and /b (9) derived above for the 2QZ is not valid for 
the 6QZ. 
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Figure 16. Spectroscopic coiriijlctciicss as a function of 6j mag- 
nitude, /s(6,j), for the 6QZ (iilled circles). Also shown is the best 
fitting model to the magnitude dependent completeness (solid 
line). 



3.4-3 Redshifl dependent spectroscopic completeness, /s(«) 

There is also the possibihty that the spectroscopic identifica- 
tion procedure has also introduced subtle redshift-depcndent 
biases. This can happen in low S/N spectra, particularly 
when only one strong QSO emission line is visible in the ob- 
served spectral window. On inspection of the smooth 2QZ 
n{z) relation (Fig. 3), we infer that any sudden changes in 
the efficiency of redshift determination nmst be small, at 
least over the 0.4 < z < 2.2 redshift range, where the pho- 
tometric completeness is also high (> 70 per cent). The red- 
shifts at which strong emission lines move into the observed 
frame (e.g. C III] A1909 at z = 1.00, C IV at « = 1.45 or Lya 
aX z = 2.12) are not marked by a significant increase in the 
n(«). At z > 0.35 there is a significant discontinuity in the 
n{z), most likely caused by the appearance of Mg II emission 
line in the observed spectral window. At these redshifts, in 
many cases wc note that objects classified as a QSO by the 
appearance of a broad Mg II emission line often do not ex- 
hibit significant visible broad H/3 or H7 emission lines. We 
conclude, therefore, that there is likely to be a significant 
number of objects aX z < 0.35 which we have failed to clas- 
sify as QSOs based only on the appearance of the Balmer 
series lines. 

In order to perform quantitative tests on the redshift 

dependence of spectroscopic completeness we carry out K-S 
tests comparing the redshift distributions of QSOs selected 
in sectors of different completeness. Wc first compare QSOs 
from sectors of 60-70%, 70-80%, 80-90% and 90-100% to 
the n{z) distribution of all QSOs. The derived probabilities 
of the null hypothesis that the n{z) distributions are the 
same are 0.318, 0.193, 0.853 and 0.133 respectively. There 
is therefore no significant (> 1.5cr) difference between these 
n{z) distributions. We secondly compare the n{z) distribu- 
tions for the lower completeness QSO samples, 60-70%, 70- 
80% and 80-90%, to the distribution for QSOs with 90- 
100% completeness. The resulting probabilities of the null 
hypothesis in this case are 0.182, 0.007 and 0.040 respec- 
tively. This does therefore indicate some small but signifi- 
cant differences. The lower completeness samples are gen- 
erally biased such that they have slightly higher numbers 
of high redshift QSOs. This is suggestive that the greater 
number of strong emission lines at high redshift is making 
it easier to identify lower S/N QSO spectra. 




Figure 17. The redshift dependence of spectroscopic complete- 
ness, a) The observed redshift distribution of all 2QZ QSOs, 
n{z)ai\ (solid line), compared to the redshift distribution of ob- 
jects with low quality identifications which were subsequently re- 
observed and given a high quality QSO redshift, Ti(^)bad (dotted 
line). n(z)aii is renormalized to contain the same total number of 
QSOs as n(z)bad- Wc also show the predicted n(z) assuming that 
the incompleteness in the 2QZ is distributed as n(2;)bad (filled 
points), b) The ratio of r»(2)aii to n(z)bad (renormalized) as a 
function of redshift. Error bars are Poisson. 



Finally we attempt to derive the completeness correc- 
tions required to correct for these small variations in com- 
pleteness as a function of redshift. If there were no sys- 
tematic variations of completeness with redshift, the total 
fractional spectroscopic completeness, /s, would simply be 
/s = fs{0,bi). However, this will be modulated by any red- 
shift dependent spectroscopic incompleteness. In order to 
determine this modulation of the redshift distribution we 
turn to the repeat observations. There are 2473 QSOs which 
have one spectrum of quality 11 and a second of lower quality 
(quality 22 or worse, including objects with ?? IDs). We can 
then use these objects to look at the redshift distribution of 
unidentified objects. In Fig. 17a we plot the redshift distribu- 
tion of all quality 11 QSOs, {n{z)ai\, solid line) and the red- 
shift distribution for those objects which had both low and 
high quality identifications in repeat observations (n(0)bad, 
dotted line). In both cases the redshift used is that from the 
best quality ID (quality 11) which is treated as the "true" 
redshift. There is a clear and significant difference between 
the two (note that n(z)aii has been rcuormalizcd) . Wo see 
that a larger fraction of bad IDs are present at 2; ~ 0.6 and 
z ~ 1.4, while there are fewer bad IDs at z > 1.8. These ap- 
pear to be due to changes in the visibility of strong emission 
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Table 3. Derived values for R{z) (and errors, (Jik^z)) that can be 
used to estimate the redshift dependence of spectroscopic incom- 
pleteness. Data are binned in Az = 0.1 bins. 
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Figure 18. The observed spectroscopic completeness within 2dF 
fields as a function of radius from the field centre. We plot the 
completeness for specific observations in a given field (filled cir- 
cles), and the completeness for the best observation of a given 
object including overlapping fields (open circles). Also plotted is 
the mean completeness from the best observations (dotted line) . 



lines as a function of redshift. The ratio of n(z)aii to n{z)had 
is shown in Fig. 17b and more clearly demonstrates the de- 
pendence of completeness on redshift. We then determine 
the relative effect that incompleteness of this type would 
have on the observed redshift distribution of the QSOs. In 
the full 2QZ sample 14.2% of sources have quality poorer 
than quality class 1. If this fraction is distributed in redshift 
as n(z)bad and added onto the observed n(a)aii we obtain 
the filled points in Fig. 17a (also assuming that the fraction 
of QSOs in the unidentified sources is equal to the fraction 
of QSOs in the identified sources, see Section 2.3.1). We see 
that making the correction changes only slightly the overall 
shape of the derived QSO n{z). 

In order to make the above corrections to fs , we define 
a function R{z) such that 

n(«)aii 



(8) 



n(a)bad ' 

where both n(«)aii and n{z)had are normalized such that the 
sum of all redshift bins is equal to one. This ratio is tabulated 
in Table 3. The completeness is a given sector and bj interval 
is then distributed according to this relation such that 

-1 



= 1 + 



1 



fs{9,bj) 



- 1 



(9) 



This relation can then be used to describe the variations 
in completeness in a general sense for the 2QZ. We note 
that several assumptions have been made in our estimate 
of redshift spectroscopic incompleteness. In particular, we 

assume that redshift dependence is separable from fs{0. b,^). 
That is, the form of R{z) does not vary depending on sector 
or 6j magnitude. 

3.5 Completeness within 2dF fields 

AH the above analysis of completeness is based on the as- 
sumption that within a single 2dF field, or more exactly, a 
single sector, the survey completeness is uniform. However, 
various effects could cause systematic variations within these 
regions. E.g. Groom et al. (2001) showed that there is a sys- 
tematic variation in the effective throughput across a single 



2dF field (sec their Fig. 2). After further investigation of 
this effect, it was found to be due to systematic residuals 
remaining from sky subtraction, caused by the spectral PSF 
of the spectrograph degrading slightly near the edge of the 
CCDs (e.g. Willis, Hewett & Warren 2001). This was due to 
the 2DFDR software effectively basing the throughput esti- 
mate of fibres on the peak fiux in sky lines rather than the 
total fiux in the lines. As a result, routines to measure fibre 
throughput from total sky line flux were written and incor- 
porated into the 2DFDR pipeline. These routines removed 
the majority of the residual sky emission, however it should 
be noted that some excess sky residuals will still remain 
due to subtraction of a mean sky spectrum which does not 
have exactly the same spectral PSF as the data (see Willis 
et al. 2001). Also, the revised version of 2DFDR could not 
be run on data taken before July 2001, so that in the final 
2QZ catalogue completeness variations as a function of fibre 
number remain. As any given fibre can be positioned over 
a large fraction of the 2dF field-of-view this incompleteness 
as a function of fibre number does not have a significant 
effect on the spatial distribution of completeness, and does 
not impact on analysis such as the study of QSO clustering. 

Due to a number of different effects, the completeness 
near the edge of a 2dF field could be lower than in the central 
region. We therefore look at the completeness within a 2dF 
field as a function of radius from the field centre. We first 
take each individual observation of a 2dF field and find the 
spectroscopic completeness as a function of radius from the 
field centre. In Fig. 18 we plot the number weighted average 
of these completeness estimates (filled circles). A decline in 
completeness with increasing radius is visible at > 0.4° , with 
regions at > 0.9° on average only ~ 65 per cent complete. 
This could be caused by a number of effects. Systematic er- 
rors in astrometry or field rotation would tend to be worse 
towards the edge of the 2dF field. Similarly, atmospheric re- 
fraction effects, if a field was observed at a different hour 
angle than it was configured for, would also be most notice- 
able at the edges of the field. In order to see how this radially 
dependent completeness affects the final 2QZ catalogue, we 
then derive the radially dependent completeness, using only 
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Figure 19. (M^j , z) distribution for 2QZ (points) and 6QZ (tri- 
angles) QSOs. Dashed lines indicate the limits of our maximum 
UkeUhood analysis, < —22.5 and OA < z < 2.1. For this plot 
we assume Hm = 0.3, Ha = 0.7 and Hq = 70kms~^Mpc~^. 



the best observation of each object, and include objects that 
arc within a given field, but actually observed in an overlap- 
ping field. This radial completeness estimate is also shown in 
Fig. 18 (open circles). When using only the best observations 
and including overlaps, we find that there is no significant 
decline in the completeness at large radii. This suggests that 
the final 2QZ catalogue should not be seriously affected by 
radially variable incompleteness. The most extreme test of 
the catalogue uniformity is in clustering analysis. We have 
performed detailed simulations of the radial completeness 
variations and their effect on the two-point correlation func- 
tion, and eoiifiriii that these radial variations have negligi- 
ble impact on the clustering analysis (Croom et al. 2003 
in preparation). These tests also imply that any variation 
in completeness as a function of fibre number do not effect 
measurements of clustering. 



4 THE OPTICAL QSO LUMINOSITY 
FUNCTION 

4.1 Data 

Based on the discussion above, we chose to use the 2QZ 
sample defined by a minimum mean sector spectroscopic 
completeness of 70 per cent, and the full SGP declination 
strip for the 6QZ. The effective survey area corresponds to 



595.9 dcg^ for the 2QZ and 333.0 deg^ for the 6QZ after 
coverage correction. All QSO magnitudes and sector mag- 
nitude limits were corrected for galactic extinction on the 
basis of the mean extinction values (Schlegel et al. 1998) in 

each sector. 

A strength of the combined 2QZ/6QZ data set is its 
homogeneous selection and photometry. With the statistical 
precision that the 2QZ/6QZ cam provide, errors in the esti- 
mate of the QSO OLF are largely dominated by systematic 
effects (apart from possibly the very brightest luminosities). 
Therefore, although it is possible to combine other surveys, 
in the analysis below, we chose not to include QSOs from 
any other catalogue. We also limit the analysis to regions of 
the catalogue where the photometric completeness is greater 
than 85 per cent (see Fig. 10), corresponding to a redshift 
range 0.4 < z < 2.1. In Fig. 19 we show the z vs. Mbj 
distribution of sources, indicating the above cuts. 

Within this redshift range, the corrections for photo- 
metric, morphological and spectroscopic completenesses de- 
rived above were then applied to this combined dataset. To 
minimize the noise on the estimate of the redshift-dependent 
spectroscopic incompleteness the estimates presented in Ta- 
ble 3 were block-averaged over Az = 0.3 bins. 

4.2 1/V estimator 

As in Paper I, we first obtain a graphical binned represen- 
tation of the OLF and its evolution with redshift using the 
1/V estimator devised by Page & Carrera (2000). We chose 
to bin the OLF into six equally spaced linear redshift bins 
over the range 0.4 < z < 2.1 and into AMb, — 0.5 bins in 
absolute magnitude. All bins that contain at least one QSO 
are plotted. However, it should be noted that bins near the 
magnitude limits of the survey can be biased even using the 
Page & Carrera estimator (see Paper I). This is the case if 
part of the bin lies outside of the survey limits, and there is 
significant evolution across the bin. 

The resulting OLF, calculated for a flat universe with 
= 0.3, Qa = 0.7 and Ho = 70kms"^Mpc"^ (which we 
call the A cosmology) is plotted in Fig. 20. The value of the 
OLF, statistical error and the number and mean redshift 
of QSOs contributing to each plotted point in the figure is 
given in Table 4. We determine upper and lower la (84.13 
per cent one-sided) confidence intervals for a Poisson dis- 
tribution (Gehrels 1986) for bins which have small numbers 
of QSOs (< 20), and hence have errors which are not well 
represented by y/N errors. 

4.3 Mctximum likelihood analysis 

To obtain a more quantitative descriptions of suitable mod- 
els we carried out a maximum likelihood analysis, fitting a 
number of models to the data (e.g. Boyle, Shanks & Peter- 
son 1988). This technique relies on maximizing the likelihood 
function S corresponding to the Poisson probability distri- 
bution function for both model and data (Marshall et al. 
1983). Previously, we have used the 2D K-S statistic to pro- 
vide a goodness-of-fit measure for 'best fit' maximum likeli- 
hood models. However, the data-set is now so large that the 
K-S test, notoriously insensitive to discrepancies between 
the data and the model predictions in the wings of the dis- 
tributions, is very poor at discriminating between models. 
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Figure 20. The optical QSO luminosity function for the 2QZ + 6QZ sample in six redshift intervals (open and filled points). The dotted 
lines denote the predictions of the best fit double power law exponential evolution model to dataset with M^j < —22.5 in the A Universe. 
The model values are calculated based on the accessible volume in each bin, exactly as the data, and so contain an explicit correction 
for incomplete bins. This means that the dotted lines may not appear to follow an exact power law. The vertical dashed line indicates 
the < —22.5 limit used in the maximum likelihood fitting. 



Instead, we choose to use the statistic, based on the num- 
bers of observed and predicted QSOs in each of the Mb,, z 
bins used for the 1/V estimator above. 

In keeping with our general philosophy of previous max- 
imum likelihood fits to the OLF, we used the minimum num- 
bers of free parameters in any model required to obtain an 
acceptable fit. Our a priori definition of an acceptable fit 
was one which could not be rejected at the 99 per cent confi- 
dence level or greater based on the statistic. Errors on the 
fit parameters correspond to the AS' = 1 contours around 
each parameter, or, equivalently, the 68 per cent confidence 
contour for one interesting parameter. 

While carrying out the maximum likelihood analysis, we 



employ an absolute magnitude limit A/b, > —22.5 for the A 
cosmology and A/b, > —22 for an Einstein-de Sitter (EdS) 
cosmology {Q,yn = 1-0, = 0.0 and Ho = 70kms"^Mpc"^). 
This absolute magnitude cut excludes AGN where the con- 
tribution from the host galaxy (either photometrically or 
spectroscopically) may lead to a significantly bias against 
the inclusion or identification of such objects in the 2QZ 
which is not accurately modelled by the above completeness 
analysis. The calculated absolute magnitude of these sources 
would also be biased by the inclusion of fiux from the host 
galaxy. 
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Table 4. Binned luminosity function estimate for Hni = 0.3, Ha = 0.7 and Hq = 70kms~^Mpc~^, as plotted in Fig. 20. We give the 
value of log ^> in 6 redshift intervals, and in AM^j = 0.5 mag bins. We also list the mean redshift (z) in each bin, the number of QSOs 
contributing to the OLF (ATq) and the lower and upper errors (Alog$). 
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Table 5. The best fit OLF model parameters from the maximum likelihood analysis for the range of models discussed. We list the 
redshift range and faint limit of the data fitted, the evolution model and cosmology used (assuming a fiat Om + f^A = 1 Universe), 

the number of QSOs in the analysis (Nq) and the best fit values of the model parameters. We also give the value for the comparison 
of the model to the data, as well as the number of degrees of freedom (u) and the probability. 
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4.4 Model fitting 

Based on our previous results with the 2QZ, we chose to 
model the OLF $(!/, z) as a double power law in luminosity 
such that 



b, , z 



Expressed in magnitudes this becomes 



(10) 



0.4{a+l)(Afbj-M*^) ^Q0.4{^f+l)(Mbj-M*^) 



10 



.(11) 



The evolution of the luminosity function is given by the 
redshift dependence of the break luminosity = Ll,^{z) 
or magnitude, M^j = M^j {z). In earlier papers in this series. 



we have modelled this evolution as a 2nd-order polynomial 

of the form 



L£,(2) = L£,(0)10'=i^+'==^ , 
or equivalent ly, 

M^^{z) = Mb*j(0) - 2.h{kiz + k2z''). 



(12) 



(13) 



In this paper wc also attempt to fit an exponential evolution 
with look-back time (r) of the form 

L;,(2) = L;,(0)exp(feiT). (14) 
Expressing this in magnitudes it becomes 

{z) = Mbj (0) - 1.08/ciT (15) 
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Table 6. The number of observed, N^bs, and predicted, Np^g^ QSOs for the best fit exponential evolution model in A Universe, along 
with the significance of the devation away from the model, cr. 
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Figure 21. Differential number-magnitude, n(bj), relation for 
the 2QZ (filled circles) and 6QZ (open circles) for QSOs with 
0.4 < z < 2.1 and M^j < —22.5. Also shown is the prediction 
from the best fit exponential evolution model in the A Universe. 

Although such models with an exponential evolution in r 
have previously been shown not to fit the QSO distribution 
(Boyle et al. 2000; La Franca & Cristiani 1997) for an EdS 
Universe, exponential evolution models have not been fitted 
to Universes with non-zero Ha- 
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Figure 22. Observed number-redshift, n(z), relation for the 2QZ 
survey for 0.4 < z < 2.1 and M^j < -22.5. The solid line indi- 
cates the best fit exponential evolution model for the A Universe. 



Those models were fitted to the survey data for both 
EdS and A cosmologies. The statistical errors on individ- 
ual parameters are: cr{a) — 0.05, o-(/3) = 0.1, a(M^_^) = 
0.2, cr(fei)(exponential) = 0.1, cr(fei)(polynomial) = 0.03, 
c(fe2)(polynomial) = 0.01. We note that the size of the 
2QZ/6QZ sample is such that residual systematic uncer- 
tainties are likely to dominate the model fitting. The com- 
pleteness analysis described above (Section 3) is aimed at 
minimizing such uncertainties. Although not affecting the 
shape of the fitted QSO OLF, any error in the survey 6j 
zero-point will translate to a shift in M^^. Checks against 
a small number of independent CCD sequences in the 2QZ 
region suggest that such an error must be no greater than 
0.05 mag. 



© 0000 RAS, MNRAS 000, 000-000 



22 S. M. Groom et al. 



4.5 Results 

We find that both the polynomial evolution in z and the 
exponential evolution in r provide acceptable fits when us- 
ing the A cosmology. The best fit model values are shown in 
Table 5, along with their derived values and probability 
of acceptance. The best fit exponential model using the A 
cosmology has a rate of evolution defined by fei = 6.15. This 
corresponds to an 'e-folding' time of approximately 2Gyr. 
Both evolutionary models provide a poor fit to the full data 
set in an EdS Universe. Although these model fits are re- 
jected, for completeness we also show the best fit model 
parameters for an EdS Universe in Table 5. 

The predicted differential number-magnitude, n{bj), 
and number-redshift, n{z), relations for the 2QZ and 6QZ 
surveys based on the best fit exponential evolution model 
in a A Universe are shown in Fig. 21 and Fig. 22. The 
model predictions provide a good fit to the derived n(bj) 
and observed n{z) relations for QSOs over the redshift range 
0.4 < z < 2.1 and with absolute magnitudes Mbj < —22.5. 
The model also provides a good match to the binned OLF 
(see Fig. 20), although some small diflferences can be seen. In 
Fig. 20 the model points are derived based on the accessible 
volume in each bin (as for the data points). This adjusts the 
model for biases introduced by binning in exactly the same 
way as the data. As a result, the model curves (dotted lines) 
do not trace out perfect double power laws, although they 
are based on the double power law model. 

To investigate any systematic deviations of the data 
away from the model we show the number of objects ob- 
served and predicted, and the significance of the difference 
between the two, a = (A^obs — Npiad)/\/ Np^cd) in Table 6 for 
the exponential model in a A Universe. Positive values of a 
correspond to cases in which there are more QSOs observed 
than predicted. ATpred is only a reasonable estimate of the 
error for large A^prcd- For small A^prcd (< 20) we use the 84.13 
per cent one-sided confidence interval in a Poisson distribu- 
tion to determine the error. As the Poisson distribution is 
not a symmetric function, we choose the error estimate de- 
pending which side of the model expectation the observed 
number of QSOs lies. While not giving the formally correct 
significance for an arbitrary number of standard devations 
(as the Poisson distribution still has a different shape to a 
Gaussian distribution) this method does produce a more re- 
alistic estimate of how significant the deviations from the 
model are. 

It is clear from Fig. 20 and Table 6 that although the 
model is in general good agreement with the data, there are 
some deviations away from the model. The most significant 
of these are at relatively faint magnitudes near the break 
in the OLF (e.g. at Mbj ~ -24.0 in the 1.25 < z < 1.53 
interval). Although the deviations axe small, these bins con- 
tain large numbers of QSOs, making the errors on each bin 
small. It is possible that the simple double power law pa- 
rameterization is not a perfect description of the data when 
sufficient statistical precision is available. It is also possible 
that residual systematic errors are beginning to dominate 
the fitting. As a second goodness of fit test we also calculate 
the 2-D K-S probability for the exponential model in the 
A cosmology, and find a value of 6.4 per cent, making this 
model acceptable at the 2a level and under our pre-defined 
criteria (models should not be rejected at > 99 per cent 
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Figure 23. The normalized cumulative distribution of QSOs in 
each of our six redshift intervals (solid lines) going from low (left) 
to high (right) redshift. These are the raw observed numbers lim- 
ited to Mbj < —22.5 in the A cosmology. Also plotted are the 
predicted cumulative distributions from the best fit exponential 
evolution model (dashed lines) after correcting them for incom- 
pleteness. 



level). At the bright end of the OLF there is some evidence 
of the steepening of the bright end slope towards higher 
redshift found by other authors (e.g. Goldschmidt & Miller 
1998), with the model slightly under- prediciting the number 
of bright low redshift QSOs and similarly over-prediciting 
the numbers of bright high redshift QSOs. By simply exam- 
ining the numbers observed and predicited we find there is a 
2a excess of QSOs in our lowest redshift bin at Mbj < —24.0 
(90 observed vs. 71.49 predicted), and a la deficit of QSOs 
in our highest redshift bin at Afb, < —27.0 (47 observed vs. 
54.72 predicited). We note that these deviations from the 
model are less significant than the those found at fainter 
magnitudes, discussed above. 

To further examine deviations from the best fit model 
we carry out an analysis similar to Goldschmidt & Miller 
(1998). We compare the cumulative distribution of the num- 
ber of QSOs observed and predicted as a function of Mbj for 
our six separate redshift slices. This test has the advantage 
of not relying on comparisons to binned luminosity func- 
tions. Both the data (solid lines) and models (dashed lines) 
are shown in Fig 23. We use the best fit model using the 
exponential evolution in look-back time in a A cosmology. 
Both the model and data are limited to Mb, < —22.5 in or- 
der to compare only the regions fitted. We see that there is 
good general agreement between the model and data. How- 
ever, we do see that the data in the lowest redshift interval 
(0.40 < z < 0.68) contains more bright QSOs than predicted 
by the model. We carry out a ID K-S test comparing the 
model and data in each of our six redshift slices. The re- 
sulting probabilities of the null hypothesis that the data are 
drawn from a distribution described by the model are 0.037, 
0.725, 0.708, 0.068, 0.806, 0.099 for the low to high redshift 
intervals respectively. The largest deviation is found in the 
lowest redshift interval, in which the data deviate from the 
model at the ~ 2a level. This agrees with the excess found 
by examining the numbers of predicted and observed low 
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redshift QSOs at Mbj < —24 and is further evidence of the 
excess of bright low redshift QSOs found by other authors 
(e.g. Goldschmidt & Miller 1998). If we examine the binned 
OLF in Fig. 20, we see that the excess over the model ap- 
pears to occur at intermediate magnitudes, Mbj ~ —24.0 
and brighter. 

We also test the effect of extending the range of our fits 
in both absolute magnitude and redshift. When fitting the 
exponential evolution model in a A cosmology we obtain 
an acceptable fit with a faint limit of Mbj < —22 rather 
than our adopted limit of Mbj < —22.5, with parameters 
virtually indistinguishable from the best fit values shown in 
Table 5 for the Afbj < -22.5 Umit. Extending the redshift 
range of our fits we find that acceptable fits to the expo- 
nential evolution model in a A cosmology arc found in the 
redshift range 0.4 < z < 2.2, but not at higher redshift (e.g. 
0.4 < z < 2.3). Similar results are found with the polynomial 
evolution model, with an acceptable fit at 0.4 < z < 2.2, but 
not with 0.4 < z < 2.3. This is indicative of either (or both) 
a change in the evolution of QSOs or (and) possible resid- 
ual systematic errors in the large completeness corrections 
required at high redshift. 

This appears to demonstrate that a sample of the size 
of the 2QZ is limited by the accuracy of corrections for sys- 
tematic imcompleteness, rather than random noise. It is also 
very likely that although the simple model fits above provide 
a good general description for the evolution of the QSO pop- 
ulation, the actual QSO luminosity function, including its 
evolution, has a more complex form. Evolution in the QSO 
OLF appears to turn over, peaking at a ~ 2 — 3, with de- 
clining QSO activity at higher redshift (e.g. Warren, Howett 
& Osmer 1994; Fan et al. 2001). There are a number of fur- 
ther analyses that could be carried out on the QSO OLF, 
however we leave these to future papers. 

The physical mechanisms which drive the shape and 
evolution of the QSO OLF arc still uncertain. The two power 
law model fits the data well, but the observed break may be 
due to a number of phenomena. The break in the OLF could 
correspond to a break in the black hole mass function, which 
in turn is related to the mass (or velocity dispersion) of the 
host galaxy. Black hole mass is seen to correlate well with 
the velocity dispersion of the spheroidal component of lo- 
cal galaxies (e.g. Gebhardt et al 2000), although this has 
yet to be observed at high redshift. However, the correlation 
between host galaxy and AGN luminosity is less clear with 
only a weak correlation with large scatter found between the 
two (e.g. Schade et al. 2000), at least at low redshift. This 
implies that AGN have a broad range of efficiencies, with 
some accreting near (or above) the Eddington limit and oth- 
ers accreting at a much lower rate. Such a broad range in 
accretion rates convolved with any black hole mass function 
would contrive to significantly blur any break. However, it is 
likely, although by no means certain, that only a small frac- 
tion of sources accrete at super-Eddington rates. This then 
provides a second natural cut off at high luminosity. In this 
context we also need to understand the cause of the strong 
luminosity evolution seen in the QSO population. One view 
is that this is driven by a decline in accretion rate towards 
low redshift, as the available fuel becomes more scarce. A 
second factor, is that possible tiggering events (e.g. merg- 
ers) are more common at high redshift. The current simple 
semi-analytic models of QSO formation (e.g. Kauffmann & 



Haehnelt 2000) predict that black hole masses will be lower 
at high redshift, but that these AGN will be accreting closer 
to the Eddington limit. This implies that for a given lu- 
minosity, black hole masses will be lower at high redshift. 
This model is called into question by the observation of no 
correlation between QSO line widths (which should provide 
an estimate of black hole mass) and redshift (Corbett et 
al. 2003). It is clear that the QSO OLF, along with other 
observations of black hole mass estimates and host galaxy 
properties have the ability to help us understand the under- 
lying physics driving AGN formation and evolution. 

The 2QZ and 6QZ surveys are being made available 
to the astronomical cormnunity. It is hoped that they will 
provide a valuable and lasting resource for astronomers, and 
allow greater insight into the physical processes governing 
AGN as well as enhancing our understanding of cosmology 
and the structure of the Universe at high redshift. 
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